SemanticScuttle - klotz.me » Tags: simon willison+llm

Tags: simon willison* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

Announcing Toad - a universal UI for agentic coding in the terminal

Simon Willison discusses Toad, a new terminal coding assistant built by Will McGugan using Textual. It aims to improve upon existing tools like Claude Code and Gemini CLI by avoiding flicker and offering better interaction with terminal output. Toad is currently in private preview, available through GitHub sponsorship.

2025-07-23 Tags: open-source, markdown, simon willison, will-mcgugan, generative-ai, llms, uv, coding-agents by klotz

Using Claude Code to build a GitHub Actions workflow

The article details the author's use of Claude Code to add a feature to a GitHub repository: an automatically updated README index. It's accompanied by a 7-minute video demonstrating the process.

2025-07-04 Tags: llm, github actions, anthropic, claude, coding, agents, youtube, screencast, simon willison by klotz

Phoenix.new is Fly’s entry into the prompt-driven app development space

An article detailing Phoenix.new, Fly.io's AI-assisted app development platform built on Phoenix and Elixir. It explores the platform's capabilities, the author's experience building a notebook application with it, and its potential for expansion beyond Elixir.

2025-06-24 Tags: erlang, sqlite, fly, llm, agents, vibe-coding, simon willison by klotz

Design Patterns for Securing LLM Agents against Prompt Injections

This article discusses a new paper outlining design patterns for mitigating prompt injection attacks in LLM agents. It details six patterns – Action-Selector, Plan-Then-Execute, LLM Map-Reduce, Dual LLM, Code-Then-Execute, and Context-Minimization – and emphasizes the need for trade-offs between agent utility and security by limiting the ability of agents to perform arbitrary tasks.

2025-06-13 Tags: cybersecurity, prompt injection, llm, simon willison by klotz

Large Language Models can run tools in your terminal with LLM 0.26

LLM 0.26 introduces tool support, allowing LLMs to access and utilize Python functions as tools. The article details how to install, configure, and use these tools with various LLMs like OpenAI, Anthropic, Gemini, and Ollama models, including examples with plugins and ad-hoc functions. It also discusses the implications for building 'agents' and future development plans.

2025-05-27 Tags: llm, tools, openai, agents, plugins, python, function calling, mcp, simon willison by klotz

Building software on top of Large Language Models

A summary of a workshop presented at PyCon US on building software with LLMs, covering setup, prompting, building tools (text-to-SQL, structured data extraction, semantic search/RAG), tool usage, and security considerations like prompt injection. It also discusses the current LLM landscape, including models from OpenAI, Gemini, Anthropic, and open-weight alternatives.

2025-05-16 Tags: self-hosted, llm, embeddings, gemini, vision, tools, simon willison by klotz

Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25)

This article details a new plugin, llm-video-frames, that allows users to feed video files into long context vision LLMs (like GPT-4.1) by converting them into a sequence of JPEG frames. It showcases how to install and use the plugin, provides examples with the Cleo video, and discusses the cost and technical details of the process. It also covers the development of the plugin using an LLM and highlights other features in LLM 0.25.

2025-05-06 Tags: ffmpeg, llm, vision, video, jpeg, simon willison by klotz

Understanding the recent criticism of the Chatbot Arena

An analysis of the recent paper 'The Leaderboard Illusion' which critiques the Chatbot Arena's LLM evaluation methodology, focusing on issues with private testing, unfair sampling, and potential gaming of the leaderboard. It also explores OpenRouter as a potential alternative ranking system.

2025-05-01 Tags: llm, benchmarks, openrouter, chatbot arena, simon willison by klotz

Qwen 3 offers a case study in how to effectively release a model

Alibaba’s Qwen team released the Qwen 3 model family, offering a range of sizes and capabilities. The article discusses the model's features, performance, and the well-coordinated release across the LLM ecosystem, highlighting the trend of better models running on the same hardware.

2025-04-29 Tags: llm, qwen, mlx, ollama, reasoning, qwen 3, alibaba, simon willison by klotz

Start building with Gemini 2.5 Flash

Google's Gemini 2.5 Flash model is a new, faster, and more cost-effective model with adjustable 'thinking' capabilities. The article details how to use it with llm-gemini, explores pricing differences compared to Gemini 2.0 Flash, and shares example SVG outputs.

2025-04-18 Tags: gemini, 2.5 flash, llm, google, simon willison by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: simon willison* + llm*

Linked Tags

Related Tags